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METHOD AND SYSTEM FOR TESTING AND CHARACTERIZING 

FOOD PRODUCTS 

CROSS-REFERENCE TO RELATED APPLICATIONS 

[0001] This application is a continuation-in-part of U.S. patent application no. 

09/639,690, filed on August 16, 2000, which claims priority to U.S. application no. 
60/171,004, filed on December 15, 1999. 

BACKGROUND 

[0002] The present invention relates to bacteriological testing and to food 

processing. 

[0003] Increasing publicity surrounding outbreaks of foodborne illness, and 

political pressure on the part of consumer groups have intensified the need for developing 
effective methods for detecting microorganisms in food and or eliminating them from the 
food chain. In the current climate of public awareness, a single outbreak of foodborne 
illness or even the identification of a contaminated product can be devastating. While 
pre-harvest and post-harvest intervention strategies are ultimately the best places to 
address microbial contamination, solutions at these levels will require long-term research 
commitments and may require implementation of substantial changes to the operation of 
food-growing enterprises. On the other hand, development of efficient testing 
methodologies may provide near-term solutions for reducing the amount of contaminated 
product that reaches the market. Despite the importance of this problem, state of the art 
testing methods remain limited in scope, are labor intensive, and are incapable of 
providing real-time information. The primary obstacles to fast and convenient testing are 
imposed by biophysical constraints on the system and sampling. In the absence of 
approaches to overcome these constraints in the near future, we are left with the question 
of how to upgrade current testing methodologies. 

[0004] One clearly desirable goal for microbial monitoring systems would be the 

development of rapid methodologies capable of high throughput and broad application. 



Atty. Docket No.: 101997-19 



However, constraints on the biochemical and biophysical processes that govern the 
sensitivity and specificity of many state of the art tests must be overcome before 
advances in real-time detection can be made. Currently, the technology for doing so is 
rudimentary at best, works only in pure solutions, and is generally not cost-effective. 
Testing is therefore generally completed only after-the-fact and monitoring of already 
packaged, sterilized food results in the recall of entire batches from the shelves. 

[0005] Even though improvements in real-time detection technologies are not 

immediately forthcoming, applicant has realized that certain other forms of testing, 
together with the predictive power achievable with computational analysis of the test 
results, may achieve significant advances over the currently available tests and test 
methodologies. 

SUMMARY OF THE INVENTION 

[0006] The present invention provides methods and apparatus for predicting 

sensory, physical, ingredient, or processing properties of a selected food product by 
comparing the microbial profile of that food product to a database that correlates known 
sensory, physical, ingredient, or processing properties to an output distribution of 
identified microorganisms. More specifically, the present invention correlates known 
sensory, physical, ingredient, or processing properties (e.g., color, taste, texture, pH, 
moisture content) of a food product with a spectrum of target species of microorganisms 
that are present in the food product. The information relating to the known property of 
the food product and the output distribution of microbial populations of that food product 
are stored in a database which can be mined to correlate the known property with the 
output distribution, so that certain target properties of a food product can be predicted by 
comparing the microbial profile of the food product to this database. 

[0007] One or more of the foregoing ends are achieved in accordance with the 

present invention by a system that employs a multispecies DNA microarray to test for 
presence or amount of a plurality of organisms, their metabolites or characteristic 
sequences. A prepared sample is applied to an array of predefined probes to detect a 
multispecies distribution, and a data processing module reads the array output. The 
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output distribution is stored, with process parameters, history or other data about the 
sample, in a database. Preferably the module or the system further includes an 
informatics module that processes the array data, by look-up or correlation, to produce 
information of diverse types, which may relate to a sensory property such as color, size, 
shape, viscosity, clarity (opacity), smell, consistency, texture or taste; to a physical 
property such as pH, redox potential, moisture content, preservative content, shelf-life, 
and acidity; to processing conditions, quality or source of a component or ingredient; or 
to potential pathogenicity or other factor. The factors may be stored as extrinsic 
parameters, or developed as derived correlations, or both. In general, the system of the 
present invention may be seen as combining a number of recent advances in methodology 
for high throughput genetic testing and informatics systems to provide a new food testing 
procedure and equipment, which are configured such that a single test provides 
quantitative and qualitative information about the population distribution of microbial 
species in the tested sample or food product. 

[0008] The method constitutes a rapid and sensitive testing procedure, and 

provides a volume and utility of information that add significant value to microbial 
testing extending far beyond the simple and conventional testing for the presence or 
absence of a pathogenic microorganism. The value-added parameters may include shelf- 
life, correlations between microbial profiles and characteristics such as ingredient quality, 
flavor potential, or the likelihood of carrying undetectable or otherwise difficult-to-detect 
organisms. The system itself readily produces new correlations of this type which may 
then be exploited for correcting process history or conditions. The information 
acquisition and processing module operates on the database to allow its utilization and 
mining, and provides a mechanism for specialization of the data to specific food products 
or other microbial environments, as well as to the incorporation or development of 
company or trade secrets. Thus, the system upgrades product testing methodology and 
revolutionizes thinking about food testing. 

[0009] As noted above the invention employs multispecies test arrays and a data 

recording system, preferably with a data mining informatics system. The multispecies 
array is a test array in some respects similar to the arrays employed in high throughput 
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genetic testing to sequence or identify genetic material, and in accordance with the 
invention, it is configured to detect specific genetic material of plural species or strains of 
organisms. The informatics system processes information on the distribution of target 
species detected by the array, and preferably their relative abundance to identify one or 
more parameters of food pathogenicity, quality, or correlated processing conditions or 
control parameters of interest. Examples of processing conditions include time- 
temperature combinations for heating and/or cooling, time-intensity combinations for 
irradiation, rate of cooling, pressure, and rate of evaporation. 

[0010] The overall operation may be implemented in an integrated and 

automated system, and may be optimized for each intended area of application by 
appropriately predefining sets of target DNA sequences that are diagnostic for sampled 
organisms, and identifying representative population profiles or associated processing 
conditions. In addition to such testing systems, the scope of the invention includes a 
testing service, which may, for example, utilize a database derived from many sources 
that includes output records for many different types of species, and operates to identify a 
range of potential organisms and/or processing parameters from the database of output 
records which may lie outside the conventionally known parameters of the process being 
studied. 

[0011] In general, applicant contemplates a system wherein a multispecies array 

provides a simple test of a food sample that in one processing sequence provides 
information on the presence and/or quantity of organisms or identifying products thereof. 
While a basic system involves sampling, preparing the sample, and then labeling target 
products and detecting the labeled products with a test array, greater sensitivity may be 
obtained by optional process steps of culture amplification before nucleic acid extraction 
and labeling, PCR amplification of nucleic acids. The array is read by fluorescence 
detection and image analysis, and systems may be assembled from a number of off the 
shelf components, together with a set of easily made DNA probes, assays or markers that 
are specific to the organisms or materials of interest. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

[0012] These and other features of the invention will be understood from the 

description herein, taken together with figures illustrating representative embodiments 
and features of the invention, wherein: 

[0013] Figure 1 illustrates a multispecies test array useful in the practice of the 

invention; 

[0014] Figure 2 illustrates different PCR approaches useful in the practice of the 

invention; 

[0015] Figure 3 shows a system of the present invention; 

[0016] Figure 4 illustrates layout of addresses and content in one embodiment of 

an array; 

[0017] Figure 4 A shows detection results in one embodiment of an array at 

different species concentrations; 

[0018] Figure 5 shows a fluorescence image identifying DNA segments which 

hybridized to labeled probes on a genome array following the procedure of Example 2; 
and 

[0019] Figure 6 shows a fluorescence image identifying DNA segments which 

hybridized to labeled probes on a multispecies array following the procedure of Example 
3. 

DETAILED DESCRIPTION OF THE INVENTION 

[0020] In contrast to traditional methodologies that test for a single organism or 

group of organisms at a time, the present invention tests a sample to provide an output 
distribution representing populations of multiple microbial species in the sample. The 
sample will be discussed in detail and illustrated for a food sample below. 
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[0021] A basic embodiment of the invention combines highly specific DNA 

probes capable of identifying many species, e.g., tens or hundreds of different 
microorganisms, into a single testing matrix, and each specific probe is located at a 
determined address or position of the matrix. The matrix, which is herein termed a 
multispecies array, allows the entire microbial flora present in a food sample to be 
evaluated and monitored. As described below, the process of testing food samples with 
the multispecies array may require a 12-24 hour period to complete the test, but its output 
provides a multidimensional distribution with added predictive power that outweighs its 
lack of real-time performance. The advantages include increased sensitivity, enhanced 
product safety, enhanced ability to understand and predict shelf-life, and an increased 
ability to predict the effects of ingredients, formulations, and processing on the properties 
of the product, such processing properties including, for example, time-temperature 
combinations, rate of evaporation during drying or dehydration, and time-intensity 
combinations in irradiation. By using multispecies arrays and creating a database of 
results, one not only identifies contaminated samples, but also sets out or measures a full 
set of parameters for understanding the relationships between microbial populations in 
food and processing or formulation parameters. Because the makeup of the microbial 
flora can dramatically influence shelf-life and organoleptic or sensory properties, 
multispecies arrays can also provide value added information regarding the effect of 
specific populations on properties such as quality and shelf life parameters of a given 
food. The invention is equally applicable to environmental and workplace testing, where 
multiple species depend in complex ways on surrounding parameters. 

The Probe Arrays 

[0022] The invention employs a specially- constructed probe array to test for 

microbial gene sequences or products. Probe arrays, perhaps one of the greatest 
revolutions in biological methods since the development of the Polymerase Chain 
Reaction (PCR) methodology, rely upon the ability to fabricate small surfaces containing 
many different DNA probes at specific addresses so that the detection of labeled material 
at those addresses indicates the presence of the specific materials to which the probes are 
directed in the original material. This has provided enabling technology for high- 
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throughput genetic testing, because rather than detecting a single gene or a group of 
genes, DNA probe arrays can be fabricated to simultaneously profile vast portions of the 
genome of an organism. Thus, the expression pattern of each gene of a given organism 
can be simultaneously monitored under different growth conditions. The present 
invention applies this approach not to drug discovery and understanding of diseases, but 
to monitoring microbial populations in a sample, illustratively food. The method and 
system of the invention may also be applied to clinical, industrial, epidemiological and 
environmental samples. Rather than an array of probes from the genome of a single 
species, the multispecies testing arrays of the invention have probes arranged at addresses 
specific for many different microbial species, and may include different strains and 
specific gene sequences within species to be found in the sample. With an array of one 
thousand probes, for example, one can test for one hundred different organisms and 
employ ten different probes for each organism. Thus, detection and quantification of any 
single organism may be based on multiple data points. This feature offers the prospect of 
detection of the presence of an organism in cases where the detection threshold of a 
conventional assay is too high to permit effective detection or quantification. Thus, the 
array provides greater information about the tested sample. 

[0023] As described further below, applicant contemplates that the multispecies 

arrays of the present invention are preferably sampled by an automated array reader, and 
their data outputs are preferably mined, correlated and developed into new, 
multiparameter tests for diverse measures of food quality and process conditions. The 
amount and usefulness of information obtained from testing multispecies arrays is 
effectively limited only by the creativeness of the array design and by the "data-mining" 
power of the software which analyzes the data. In the context of food samples, applicant 
contemplates fabrication of arrays having probes specific for all known bacterial 
pathogens that could be carried in the intended food sample, as well as spoilage and 
commensal organisms. Such an array would therefore enable evaluation in a single test 
of not only the safety of a product, but also parameters that contribute to its shelf- life and 
organoleptic properties, or sensory properties such as color, size, shape, viscosity, clarity, 
smell, consistency, texture, and flavor. 
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v Value-added With Test Results From Probe Arrays 

[0024] The traditional approach of food testing has been to examine the sample 

for the presence of a single species of pathogenic microorganism or group of indicator 
organisms such as coliform bacteria. Even state of the art testing based on genetic 
identification is designed to detect a single target gene from a single species or closely- 
related groups of microorganisms. However, by designing a suitable multispecies testing 
array, and by configuring an informatics system to present data from the array, the 
invention effectively screens entire microbial communities in a single test. By providing 
a correlation module to correlate information about the population distribution of 
microorganisms with any factor or other extrinsic data, the distribution output may be 
extended to yield other useful information. Such extrinsic factors may be present as 
fields or associated records in the database, indicating, for example, a processing 
condition (e.g., time-temperature combinations, rate of evaporation, rate of cooling, 
pressure characteristics, and time-intensity combinations for irradiation), food component 
source (e.g., supplier, origin, genotype, and period of growth), or simply the array output 
(i.e., test results of multiple organisms in the array). 

[0025] Several example applications of correlating output distribution include the 

detection of dangerous conditions by identification of new indicator organisms 
correlating with the presence of difficult to detect organisms or conditions; detection of 
quality-related conditions, for example, by correlation between particular species and the 
shelf-life of a product or its organoleptic properties; and correlation between specific 
ingredients (e.g., the specific suppliers of an ingredient, formulations of ingredients in a 
product, the origin of the ingredient (e.g., geographic location or growth range), the 
variety or genotype of the ingredient (e.g., cultivar of wheat), the date or period of growth 
(e.g., vintage)) and the presence or growth of problematic microorganisms. These 
correlations may be determined automatically, i.e., discovered by data mining programs 
that automatically access, update, and correlate information in a database. The database 
may be generated from array output distributions for the tested samples, together with 
extrinsic information such as conventional records of batch lot numbers, ingredient 
invoices, product returns, later-reported or later-detected contaminants, testing/validation 
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methods used, and nutrient content. The correlations so derived, and the database and 
testing criteria that are established in this manner in a product processing line, thus form a 
database that, over time, may constitute a valuable proprietary trade secret. 

[0026] By way of example, one value-added piece of information obtainable from 

the invention's multispecies array testing methodology may be the finding that the 
texture of a particular food product becomes slimy due to the production of extracellular 
slime by a species of spoilage bacteria such as Alcaligenes eutrophus. The present 
invention allows such an observation to be carried further. By maintaining an historical 
database of test results with other observed parameters, one may then, using stored 
testing array output distributions, discover preconditions and causative factors. For 
example, database correlations may show that A. eutrophus is commonly present, but that 
it only synthesizes the slime when growing in the same microenvironment as certain 
species of Pseudomonas. Observation of this correlation would lead to a conclusion that 
A. eutrophus presents a practical problem when Pseudomonads are also present in high 
numbers. Several such examples of extracellular compounds or byproducts from one 
species of microorganism impacting the physiology of a second species have been 
reported in scattered microbiological studies in the prior art. By adopting a multispecies 
array testing methodology, the discovery of further relationships of this type may be 
automated, and follow naturally from correlation of output distribution test data from the 
multispecies array testing of food products. 

[0027] A second example of the power of applying data mining to the 

multispecies array distribution records is the identification of microbial profiles among 
ingredients that are indicators of potential problems with subtle sensory or organoleptic 
properties of the food. In this instance, data from testing ingredients used in production 
of processed vacuum-packaged meat products may demonstrate, for example, that an 
off-flavor is consistently reported for batches when the recorded distributions for those 
batches show that certain species of the genus Carnobacterium are present in the 
ingredients. Further mining would then allow correlation of test results with the source 
of the tested foods to allow discovery of which suppliers provided ingredients containing 
those specific species or strains of Carnobacterium, or which processing conditions 
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permit its growth, and therefore may be more likely to produce off flavor. For example, 
the present invention can be used to determine the time-temperature combinations that 
are not likely to result in the extermination of this species of microorganism. 

[0028] In addition to identifying new correlations between microbial populations 

and product properties, the multispecies arrays by repeatedly determining the spectrum of 
species in a fixed food, process or environment also enable the determination of new 
correlations between detectable species that can then be used as highly-specific indicator 
organisms. For example, while coliform bacteria have been used classically as indicator 
organisms, their presence is not necessarily a sufficient indicator of fecal contamination, 
since some coliform species can also be found naturally on plant material. This makes 
their potential as an indicator for fecal contamination on raw ingredients less useful. 
With the use of a multispecies array and data-mining informatics, software can 
automatically record the species present in food samples and determine which specific 
organisms are associated with pathogens. Thus, the array miner may "discover" that 
when the specific coliform profile contains primarily species of Enterobacter, there is 
poor correlation with the presence of enteric pathogens such Salmonella enterica or 
Shigella species. On the other hand, the informatics software might also determine, by 
"discovering" a correlation, that when the coliform bacterium Klebsiella oxytoca is 
present along with Citrobacter fruendi, there is a much higher probability that the 
coliform bacteria came from a source where enteric pathogens could be present. The 
invention utilizes this type of multi-component correlation to implement new screening 
or testing methodologies. With such an indicator or test criterion, then, when a testing 
array scores positive for K. oxytoca and C.fruendi, one pulls the product aside and tests 
more thoroughly for the presence of low levels of Salmonella and other enteric 
pathogens. Furthermore, new arrays may then be fabricated to detect the specific species 
required for the newly-identified screening tests. 

[0029] Although each of the above examples is hypothetical, they illustrate 

biologically reasonable approximations of the type of information present in array test 
data, which is simply and automatically derived by data mining, and they illustrate the 
range of substantive knowledge and new testing protocols of the invention. These 
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examples specifically illustrate the value-added component that multispecies arrays and 
data mining provide, as compared to a conventional test for the simple presence or 
absence of specific pathogens. Thus, the key elements of the approach are the 
configuration and fabrication of the testing array, the data mining method, and its 
extensions to provide specificity, depth, and proprietary databases of test data, 
correlations and derived knowledge. 

System Operation 

[0030] An overview of steps of the present invention is illustrated in Figure 1 and 

is described below. 

[0031] The multispecies array testing system of the present invention is 

preferably implemented as a completely automated system, with high-throughput 
potential and low cost per sample. A basic embodiment of the system prepares a food 
sample and tests for a palette or distribution of organisms or genetic material 
characteristic of the set of organisms by labeling material prepared from the sample and 
reading the labeled material in each of a plurality of wells or positions of an array. The 
process may also be tailored to detect organisms with higher sensitivity by first culturing 
the sample with a culture medium cocktail to simultaneously multiply several different 
cell populations before extracting DNA for array testing. It is understood, however, that 
the culture amplification step is optional and that the extracted nucleic acid can be 
directly labeled without amplification. The multispecies array can include target 
sequences that are species-specific, and selected for either efficient amplification as a 
group or for their uniqueness (e.g., ability to be differentiated from genes in other 
species). 

[0032] Typically, the entire testing procedure, with the culture amplification step, 

may take approximately 8-12 hours per sample and can include five steps: i.) a culture 
step (e.g., 6-8 hours) designed to increase a subgroup of target populations of 
microorganisms; ii.) a sample preparation step which may include extraction and 
concentration of nucleic acid from the cultured organisms; iii.) a target amplification and 
labeling step in which the polymerase chain reaction (PCR) is used to amplify a plurality 
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of different targets and label the amplification products with fluorescent compounds; iv.) 
a hybridization step in which the labeled amplification products are allowed to hydrogen 
bond to their respective target probes on the multispecies array; and v.) a detection step in 
which the fluorescence intensity of each probe address on the multispecies array is 
detected. In an alternative exemplary procedure, the target labeling step can be 
performed without amplification in which the entire DNA complement extracted from the 
culture can be labeled by random priming or direct chemical attachment of the label to 
the DNA. It is understood that the present invention can utilize a variety of species 
identification techniques, including non-PCR techniques. Therefore, while the examples 
described herein utilize PCR for convenience, the present invention should not be limited 
by such use. Preferably the process is automated, although it may also be performed by 
manual culturing and sample preparation. 

[0033] One embodiment of a process in accordance with the present invention 

using a multispecies array is shown in Figure 1 for testing a food sample. In a first step 
1, the food sample is inoculated into a culture medium and incubated for a sufficient time 
to permit multiplication of the cells. Different species of microorganisms are represented 
by different shading in Figure 1. The Figure illustrates the process schematically with a 
single test tube as a culture vessel, and preferably the culture process is designed to 
simultaneously culture all, or many, of the target species simultaneously. However, the 
sample may also be divided and placed in two or more vessels that present different 
culture conditions, so that each vessel is effective to simultaneously culture a subgroup of 
the targeted number of species. 

[0034] In the second step 2, nucleic acids are extracted (as indicated by the 

squiggly line) and are mixed with PCR reagents or random priming reagents, preferably 
including fluorescently-labeled PCR primers. In the Figure, arrows with black stars 
represent the fluorochrome. 

[0035] In the third stage 3, enzymatic amplification by the polymerase chain 

reaction (PCR) is carried out with labeled probes to increase the number of segments of 
the genomes from each of the target organisms, with each amplified segment containing a 
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fluorescent label. In the Figure, target genes amplified from the microorganisms present 
in the original food sample are shaded differently to indicate different species from which 
they originated. In a variation of this step, amplification may be performed with 
unlabeled probes, and a chromophore or other label is added during the amplification or 
subsequently. For example, to effect subsequent labeling, biotinylated probes may be 
used, thus retaining the PCR products on a column, with a subsequent step of eluting the 
amplified biotinylated strands onto the array, and applying labeled antibodies to the 
immobilized material. Another post-synthetic labeling method may include chemical 
cross linking of fluorochromes onto different moieties of the amplified DNA chains. 
Lastly, the extracted DNA can also be labeled without prior amplification through use of 
random priming, direct biotinylation, or chemical labeling. 

[0036] Continuing with a description of Figure 1, after the described 

amplification/labeling of step 4, the fluorescently-labeled PCR amplification products are 
allowed to form specific hydrogen bonds with complementary sequences located on a 
probe array 50. The array 50 is constructed such that each individual address, 
represented in the Figure by a rectangle on the probe array, contains a different gene 
probe, and such that within a given row 51a, 51b .. .of the array, all rectangles or sites 
contain probes specific for gene sequences of a single species or strain. Although the 
probes for each species are grouped together in this illustration, it is understood that the 
different probes for a given species can be positioned at any location on the array. Thus, 
the fluorescently-labeled PCR amplification or random priming products will form 
hydrogen bonds specifically with the complementary probe from the row and column 
corresponding to the species and gene respectively from which it originated. 

[0037] Finally, in step 5, fluorescence detection and image analysis identify the 

addresses on the multispecies array which fluoresce and, in some embodiments, may also 
quantify the degree of detected fluorescence. The probe addresses testing positive are 
indicated in black in Figure 1 . 

[0038] Implementation of the foregoing steps in done as follows. 



-13- 



Atty. Docket No.: 101997-19 



Culture or Other Amplification of Target Organisms 

[0039] Because food matrix compounds can have profound inhibitory effects on 

PCR amplification, bacterial cells are separated from the food matrix before performing 
PCR. Systems of the invention may employ two basic approaches to achieve this. The 
first approach separates of the target cells or their nucleic acids from the food matrix, for 
example using microfluidics devices. Systems employing microfluidic sampling and 
preparation may be optimized to provide favorable performance in terms of real-time 
assays or speedy performance. In this approach, a microfluidics system is programmed 
and prepared with the necessary reagents for automated sampling and purification of 
microorganisms and their nucleic acids from the sample. In the second approach, the 
cells are multiplied by elongation and division in culture, so they increase logarithmically 
over time. Most foodborne organisms of interest have doubling times ranging from about 
thirty minutes to one hour under favorable conditions. Thus, a six hour culture step 
would allow six to twelve doublings of the population. Because of varying nutritional 
requirements of different species, differences in growth rates (causing faster growing 
cells to overtake the culture), and the presence of process-injured cells, it may in some 
cases be difficult to devise a single medium that permits efficient growth of all target 
organisms. In that case, one may provide a few different types of media to enrich the 
populations of different species or groups of species. 

Nucleic Acid Extraction 

[0040] After collection of the microbial cells from the food sample and lysis of 

the cells to release the nucleic acids, the nucleic acids are concentrated by precipitation or 
by adsorption to a solid matrix such as hydroxyl apatite, glass, diatomaceous earth, or 
plastic, and PCR amplification or random priming (described below) is performed on the 
immobilized genetic material. The extraction step may be performed manually in a 
prototype by simply adding chaotropic agents to the culture, followed by pouring the 
mixture through a small sample preparation column or cartridge containing an adsorption 
matrix. The column cartridge may then be washed and placed into a device for PCR 
amplification or random priming labeling. Preferably, the system of the invention 
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employs a special cartridge configured for automated implementation of several 
subsequent steps. In this aspect of the invention, the culture or food sample is poured 
into a cartridge that fits into a fluidics system having automated fluidics pumps that then 
operate to perform cell lysis, nucleic acid extraction, washing, and PCR amplification or 
random priming labeling within the cartridge. There are currently several prototype 
fluidics systems on the market and in Federal laboratories that function on the 
meso-fluidics (milliliter volumes), micro-fluidics (microliter volumes) and nano-fluidics 
(nanoliter volumes) scales. For food testing, applicant contemplates an automated system 
of the invention as a meso- or micro-fluidics system with an integrated system controller 
that is set or programmable to pump the appropriate sequence of solutions through the 
cartridge. An existing commercial sequence labeling system may be augmented to add 
such PCR capability by providing a thermal jacket or other heating mechanism around 
the cartridge, with additional controller functions to provide thermal cycling to 
implement PCR incubations for amplification. 

PCR Amplification and Fluorescent Labeling 

[0041] Implementation of this step involves selection and optimization so as to 

enhance the sensitivity and specificity of the test. The PCR amplification step consists of 
using short oligonucleotide primers as substrates for enzymatic amplification of different 
target segments of the genomes of organisms present in the culture. Probes 
corresponding to these target segments are also provided on the array 50. In the 
embodiment illustrated in Figure 1, each PCR primer is labeled with a fluorochrome. 
After amplification by the PCR process, the fluorescent amplification products contain 
enhanced levels of the target sequences that are complementary to probes on the 
multispecies array which are specific to the strains or species of interest for which the 
array has been configured. 

[0042] Implementing optimization of this amplification and labeling step is 

carried out by selecting and evaluating fluorescent primer combinations so as to identify 
those which most effectively amplify the desired target sequences. Applicant 
contemplates two general approaches that may be used: amplification of the specific 
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target sequences, or amplification of a variety of segments of the chromosome including 
those which are complementary to the target probes. As illustrated in Figure 2, this 
involves making or selecting sets of gene-specific primers that are capable of specifically 
directing enzymatic amplification of each target as shown in panel A of Figure 2, or 
selecting sets of primers that direct amplification of multiple segments of the genome, 
only some of which contain gene sequences that are complementary to probe addresses 
on the multispecies array, as shown in panel B. Yet a third approach is to use short 
sequences that are common to sets of target genes as sites for primers. Each of these 
approaches may be used. The first method offers the prospect of enhanced specificity or 
sensitivity. As noted above, the invention also contemplates methods wherein the DNA 
is fluorescently labeled during or after amplification; this may allow optimization of the 
amplification step with fewer constraints, or permit a greater signal level to be achieved 
to further optimize an array and processing system. 

[0043] Figure 2 illustrates the two different methods for PCR amplification and 

fluorescent-labeling of the products. Enzymatic amplification by the PCR process occurs 
between the PCR primers, shown as small rectangles with arrows, that are directed to 
particular positions in the chromosome, which is depicted as a large circle. Portions of 
the chromosome having gene segments that are targeted by the multispecies array are 
indicated by large shaded rectangles. For simplification, only one chromosome from 
only a single species is represented. In panel A of Figure 2, each specific target gene is 
amplified by a highly specific set of primers that direct enzymatic amplification only of 
segments from the genome that contain genes being targeted by the multispecies array. 
In Panel B, the same target genes are amplified; however relatively non-specific PCR 
primers are used, and they amplify several segments of the genome, which include the 
target genes but also include other segments that are not targeted. 

[0044] The determination whether a system of the invention uses PCR primers 

that are fluorescently labeled, or else incorporates fluorescent labels into DNA chains 
produced during the PCR amplification process (either concurrently with or following 
DNA synthesis) may depend on factors that vary with the particular combination of 
species targeted with the array. One advantage of using labeled primers is that 
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fluorescent primers can easily be synthesized and purified, and thus will not have to be 
optimized. On the other hand, the use of a labeled primer results in incorporation of only 
a single molecule of the fluorochrome per DNA chain, whereas incorporating fluorescent 
bases into the growing or completed DNA chains during or after PCR amplification can 
provide DNA chains with multiple molecules of fluorochrome per chain, thus increasing 
the signal level and the sensitivity of detection substantially, e.g., as much as ten-fold. 
However, one cannot rely blindly on incorporation of flourescent bases during PCR to 
raise sensitivity, because the thermostable polymerases have a low affinity for most 
labeled base analogs and thus, the labeling process may not be as efficient as one would 
desire. Thus, applicant further contemplates embodiments wherein the amplified nucleic 
acids are chemically labeled after amplification to optimize the process for a given array 
with specifically-targeted organisms. 

[0045] The last issue to consider in regards to fluorescent labeling is which 

fluorochromes to use. The fluorochromes currently available can be divided roughly into 
two categories: those that fluoresce in the visible wavelengths of the spectrum and those 
that fluoresce in the near-infrared (NIR) region. The type of fluorochrome that is used 
will dictate the type of detection system that is employed to read the array. The signal to 
noise ratio for the NIR region of the spectrum is highly favorable and can offer detection 
limits that cannot be achieved in the visible region. However, relatively few IR 
fluorochromes are currently available and, due to their size and chemical properties, these 
may be incorporated even more poorly than visible fluorochromes during 
PCR. Presently, although prototype NIR array detection instruments are available for 
development purposes, NIR detection systems are not on the market, and the prototypes 
that are available detect larger spot sizes which would require a larger size testing matrix, 
array 50. However, the range of available probes and equipment may grow quickly, and 
applicant therefore contemplates selecting or developing suitable fluorescence labeling 
based on both visible and near-IR fluorochromes for prototype implementations of the 
system to evaluate which will provide the most efficient testing system. Thus, applicant 
contemplates simultaneously building or selecting appropriate materials for use in each of 
these labeling/detection technologies. 
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Probe Matrix and Array Design 

[0046] The probe matrix 50 shown in Figure 1 is laid out such that each different 

row contains six different probe addresses for a single organism. Each probe address 
contains several thousand molecules of a particular probe. Larger arrays of this type may 
be fabricated with hundreds of rows and columns of probe addresses. The organization 
of the addresses on the array can be such that the probes for a single species are in a 
specific pattern or they may be dispersed randomly among the matrix of addresses. A 
single probe may also be deposited at multiple locations to facilitate statistical analyses of 
the data. Several different methods may be employed for arraying probes onto the solid 
matrix. These approaches fall into two different categories: those based on synthesizing 
probes onto the surfaces in situ and those which involve deposition of pre-made probes 
onto a surface. 

[0047] Methods available for depositing or spotting pre-made probes onto 

surfaces allow one to rely on a broad range of commercially available probes. These 
spotting methods fall into five different categories. These include those based on 
capillary action to lift and expel samples, those based on electrochemical reactions to 
focus charged probes to specific addresses on the testing surface, those based on ink 
jetting technology to spritz samples onto surfaces, those based on precision 
micropipetting, and the quill-based technologies. Electrochemical devices currently 
produce only small arrays. Current arrays of this type have a size of approximately 100 
spots and thus could produce only limited distributions that would not embrace the fixll 
range of value-added potential discussed above for applying the testing array output 
distribution. Currently, several systems involving the use of capillary action, 
micropipetting, and quills are on the market. The selection of technology for an array 
system may take into account the advantages and disadvantages of the different array- 
making systems to effectively fabricate prototype or commercial embodiments of 
applicant's multispecies arrays, and these may differ for different sets of target 
organisms, or intended applications. Because critical criteria such as compatibility with 
the materials or processes of a given technology may not have been explored for some of 
the target species or materials contemplated herein, preferably some model arrays are 
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first fabricated by the intended user with the equipment or processes of select vendors, 
and evaluated to determine the most suitable system. 

[0048] The array size, and provision of multiple probes per targeted organism, is 

an advantageous feature of the invention. With an array of one thousand addresses, one 
hundred different microorganisms can be targeted with ten different probes per 
organism. Detection of any one of the organisms being evaluated would therefore be 
based on ten datapoints. Having multiple datapoints increases the specificity and 
statistical reliability of the test, and allows strain-specific as well as species detection to 
be implemented. Moreover, it also provides several data points from which to derive 
quantitative information. It also allows applicant to design a single array, but to vary 
which organisms are tested for and which target DNA sequences from a given species are 
be used for detection. 

[0049] The species one tests for may differ among different types of food 

matrices, but one general approach for a basic embodiment of the invention is to have 
addresses for coliform organisms and other enteric bacteria, and addresses for known 
foodborne pathogens, spoilage organisms, and commensal flora. Genes that can provide 
a molecular signature of each species or strain are preferably used. This may include 
groups of genes that are found among each of the species, but with probes designed to 
detect regions of the genes that are specific to each species. A second group of genes to 
be targeted are genes that provide specific signatures of each species, such as virulence 
genes for pathogens, and genes encoding degradative enzymes for spoilage organisms. 
Genes that encode enzymes of specialized biochemical pathways may also be used as 
diagnostic markers or indications for each different species. In addition, the probe may 
also be configured to detect gene sequences for which the function is not known, but 
which amplify well and are highly specific to the targeted species. 

[0050] In addition to discrimination at the species level, specific probes may be 

used for discriminating microorganisms at the sub-species level. For example, genes 
encoding specific virulence factors could be used to discriminate different populations of 
pathogenic strains of E. coli from non-pathogenic strains that might normally reside in 
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samples of some types of food. As another example, genes specific for biochemical 
pathways found among sub-species of fermentation organisms may be used to determine 
which species of lactic acid bacteria were present in raw vegetables that were intended 
for a fermentation process. 

[0051] Thus, applicant contemplates that multispecies arrays of the invention may 

be generally directed to a palette of generally relevant species, or be tailored to a more 
specific food application and directed to the different microbial species and the different 
types of target genes that are relevant to that application. Thus, the invention may 
employ product-specific arrays, as well as generic arrays that are useful for a wide variety 
of samples. 

Fluorescence Detection and Image Analysis 

[0052] After the fluorescently-labeled PCR amplification products from the 

original microbial population have been hybridized to their respective addresses on the 
multispecies array, the hybridization events are detected by image analysis of the pattern 
of fluorescence. As illustrated in Figure 1, the addresses where fluorescence occurs in 
the array then provide a signature of each species present in the sample, and the 
collection of fluorescent addresses provides a population distribution of the species 
present in the samples. This is read out by an image analysis system which may apply 
detection based on fluorescence signals emitted in the visible or Near-IR range, or a 
customized reader may be built to detect both spectral regions. Which region of the 
spectrum is chosen may be based on the behavior (sensitivity and specificity) of the 
ftuorochromes that are used to label the PCR amplification products. As noted above, 
initially applicant contemplates the use of commercially available label material with 
fluorescence in the visible band. 

[0053] In addition to simply creating an image of the fluorescence intensity of the 

addresses on the array, sampling software converts the output to a user-friendly format, 
such as a simple bar graph representation of the fluorescence intensity of the species 
associated with array addresses, which may be normalized relative to a baseline control. 
Such an output allows an educated analyst to evaluate the entire array for both 
presence/absence and relative abundance of target organisms in the original sample based 
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on the intensity of the fluorescence. The software may also quantify and compare the test 
readings internally, and provide an output highlighting which organisms are present, their 
estimated abundance and explanatory notes or highlights regarding warning thresholds, 
unusual or out-of-limit results or the like. 

Data Mining and Correlation 

[0054] In accordance with a further aspect of the invention, the processor is 

configured to mine, correlate, and screen data so as to detect information or develop 
detailed knowledge applicable to the species, to the processing and to the food products 
being tested. Thus, in a mining mode, for example, the user may compare the different 
parameters of the sample with similar samples in the database to determine where it lies 
in larger context or world of possible or desired conditions or results. This mode 
preferably employs a relational database that links the microbial population landscape to 
other parameters of the food sample. Thus, the user can evaluate the sample in terms of 
the average distributions of species relative to the database. Action levels may then be 
set based on deviant distributions rather than on arbitrary values or on a regulatory 
threshold for a single species or class. 

[0055] One method for conducting such an analysis is by superimposition of the 

population profile detected by the array onto a landscape of parameters or sentinel 
measurements. These parameters may include data parameters provided by 
non-bacteriological detectors of process conditions, such as pH, starch or sugar content, 
measure of volatile components, nitrogen content, or fat content. Comparison of the 
sample profile on this landscape with the normal distribution is then applied to provide 
predictive information on the quality of the tested sample. 

[0056] Thus, as shown in Figure 3, the processor 30 may receive 

keyboard-entered or bar-coded data or direct input of data from parameter sensors in the 
process line to indicate the various processing parameters for the lot being tested. Inputs 
may also be provided from other sources, such as keyboard or bar code entry of data from 
invoices and inventory slips, to indicate the source or age of each ingredient, and any 
special conditions such as early or late harvest, dry or wet growth season or other factors 
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that may either aid in understanding the array reader distribution test result, or may be 
desirable to monitor or change in view of the array results. Thus, for example, if one 
source of raw produce is found to supply produce with an unusually high level of an 
adverse species, the correlation processor may identify this condition with the source and 
allow appropriate actions to be taken. Similarly, extrinsic data on taste, color or other 
quality checks may be entered in the database, and correlated with any of the array results 
or data fields to provide predictive knowledge for process control, or to guide the 
addition of new species to the test array. Thus, for example, if a particular microbial 
population profile is found to correlate with highly desirable flavor characteristics, such 
as would be determined by a taste panel, the processor could discover the correlation 
between the sensory score for that particular flavor characteristic and the population 
profile. 

System Implementation 

[0057] For building initial multispecies arrays of the present invention, two 

fundamental approaches may be used. One can simply design the arrays and have them 
fabricated commercially by a vendor of such services. Alternatively, the arrays may be 
produced within the research or food-processing entity itself. Although the former 
approach may seem advantageous, convenient and inexpensive during the initial stages of 
assay implementation, applicant's invention contemplates on ongoing process of array 
development or evolution, and it is preferred in the long run to have an array maker 
available through the food processing corporation or other entity that is actively 
controlling the preparation and processing steps. This facilitates making alterations to the 
arrays to address transient or evolving concerns, and simplifies optimization and 
evolution of the array and the array-based testing methods. It also allows the processor to 
exercise enhanced quality control, and will conveniently allow production of the arrays to 
be coordinated with processing and increased when it is desired to increase the output 
potential of a process or of the array maker. Since the information-mining operation of 
the system is expected to result in continual creation of new tests or optimizations, 
in-house array making will greatly speed up the incorporation or evaluation of ongoing 
developments. The actual array fabrication process may employ any of the several array 
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maker types currently on the market. In general, the range of utility of a given 
array-making device, its component costs and the operating times required for its various 
steps as well as the range of adjustable parameters may vary widely and thus, affect its 
utility for the purposes described herein. Thus, any array-making system that offers 
flexibility and economy to perform a range of preparation, amplification and labeling 
tasks outlined above would be suitable for the invention. 

Probe Configuration 

[0058] The initial selection of species and probes for a test array can be done by 

computation. This may be done by selecting an appropriate number of target organisms 
or species that are already known to be of interest, and then searching a database such as 
GenBank to obtain a list of gene sequences that have been identified for each of the 
species. The gene sequences may be cross-checked to identify which ones are specific to 
the organism, or ones which are common to several, or a class of organisms may be used. 
Among the gene sequences so identified, one may select particular ones that encode a 
toxin, or are important for virulence, or are otherwise associated with a known and 
important trait such as a biosynthetic pathway. Probes are then made for the selected 
sequences for each species to fill in the array and to carry out the full processes of 
amplification, labeling and array detection. The probes can be made by PCR 
amplification of the genes from a representative target organism or they may be synthetic, 
prepared by chemical synthesis of a segment of the gene of interest. 

[0059] For example, on a model array, the applicant has fabricated several classes 

of genes from different organisms that singly and collectively are diagnostic of those 
organisms. Toxin-encoding genes that have been fabricated onto the array are the 
hemolysin BL (hblA, hblB) gene from Bacillus cereus, Listeriolysin O (hlyA) gene from 
Listeria monocytogenes, enterohemolysin (ehxA) and shiga toxin (stxA2) from genes 
E. coli 0157:H7, enterotoxins A, G, D, I, (entA, entG, entD, entl) and toxic shock toxin 
(tsst-1) genes from Staphylococcus aureus, and the shiga toxin (stxA) and 
vacuole-forming toxin (vacB) genes from Shigella dysenteriae. Other genes that 
participate in various aspects of virulence or pathogenicity that have been fabricated onto 
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the array include ipa-P60, ItrA, mpl, inlB, inlA, plcA, and plcB from Listeria 
monocytogenes, yopD, yopB, yopH y yopE, virF, yopN f yopP and yopM from Yersinia 
enterocolotica, sarA, from Staphylococcus aureus, icsB t ipaC, ipaB f and virF from 
shigella dysenteriae, sopE, pagQ invJ, spiA, prgK, and invH from Salmonella enterica, 
and hlyA t toxB f espP f eaeA t etpC, and escT bom E. coli 0157:H7. Other genes on the 
array comprise a broad category of genes encoding components of specific biochemical 
pathways that are diagnostic of particular organisms. Sporulation genes are diagnostic of 
spore forming species such as Bacillus subtilis and include the spoIIGA, sigB, spoOA and 
spoIIAA that have been fabricated onto the array. Genes encoding unique biochemical 
pathways have also been fabricated onto the array including the beta-glucuronidase 
(uidA) and beta-galactosidase (lacZ) genes from E. colL Lastly, genes that encode 
common functions but have unique structures for diagnostic purposes have also been 
included on the array. These include the dacF gene of Bacillus subtilis that participate in 
cell wall synthesis, the pglA, pglD, pglF, and pglB genes of Campylobacter jejuni that are 
involved in protein glycosylation (and potentially virulence), the cipA gene encoding a 
phosphatase and flaB gene encoding the flagellin subunits of the flagellum of 
Campylobacter jejuni, the aspartyl-tRNA and threonyl-tRNA encoding genes (asp-tRNA 
and thr-tRNA), the ribosomal RNA subunit gene (23SrRNA), the secondary sigma 
subunit sigma B encoding gene (rpoS), and the glucose dehydrogenase gene (gdH) of 
E. coli, the nuclease encoding gene of Staphylococcus aureus, and fur gene of Shigella 
dysenteriae that is involved in iron uptake, and the traJ and the traK genes encoding 
plasmid transfer functions in Salmonella enterica. 

[0060] Once the probes have been chosen and prepared, they are then deposited 

onto the array in one of several possible patterns. A simple pattern would position all of 
the probes for a specific species within one spatial location on the array, for example 
adjacent to one another. Another pattern would position duplicate probes at two or more 
different. Still another pattern would place duplicate probes at random positions on the 
array. 

[0061] Initially, the primary genes of interest will be those which are important 

for virulence in specific species. However, once a database of test results and related 
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process or ingredient histories is established second-generation research or detection 
probes may be fabricated to extend the range of useful results. The process for forming 
such second-generation arrays will be somewhat different. For example, one may get a 
listing of all gene sequences in GenBank for a particular organism, then cross check the 
sequences to determine which are unique to the subject species or strain. As before, one 
designs appropriate probes for amplifying and detecting the characteristic sequences, and 
the resulting array and process are then used under a range of variation to quickly 
determine the specific sequences that provide greatest sensitivity, provide compatibility 
with a PCR cocktail, and other properties for optimizing array configuration. The 
sequences need not code for virulence or, indeed, need not have any recognized function. 
The ability to use any sequences listed in GenBank for the target organism allows one a 
large choice of suitable gene sequences for identifying the target organism while 
achieving simultaneous compatibility with common processing cocktails for multiple 
species. Alternatively and additionally, diagnostic sequences may be identified 
experimentally. One approach would be through design and implementation of a single 
species array having addresses that represent the entire genome. This array is then 
probed with the entire, genomes of closely related species to find addresses to which no 
corresponding target sequence can be found to hybridize from other species. 

EXAMPLE 1 

[0062] To examine sensitivity and specificity of a multispecies array, a model 

array was fabricated to detect seven different species of pathogenic microorganisms, a 
spoilage organism, and one indicator organism. These were the pathogens Listeria 
monocytogenes, Staphylococcus aureus, Escherichia coli 0157:H7, Shigella dysentariae 
Yersinia enterocolitica, Campylobacter jejuni, and Salmonella enterica Var. 
typhimurium. Generic E. coli was taken as an indicator organism, and the selected 
spoilage organism was Bacillus subtilis. Ten diagnostic genes were chosen from each of 
these organisms, amplified by PCR, and the products deposited on an array on 
poly-L-lysine coated glass slides. Each array included duplicate addresses that contain 
probes for each of the target genes, and six different control addresses that contained 
varying amounts of DNA encoding the green fluorescent protein used for labeling, which 
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is not known to be carried by natural populations of bacteria. Performance of the array 
was then examined by applying PCR amplification products from known concentrations 
of DNA purified from Bacillus subtilis, (1 A-1R) Salmonella entericia, (8A-8R) Listeria 
monocytogenes (1S-1X, 3S-3X, 4S-4X) and£. coli 0157:H7 (5S-5X, 6S-6X, and 
8S-8X). The amplified target PCR products were labeled by incorporation of 
fluorochromes into the growing DNA chains during each round of enzymatic DNA 
polymerization during the PCR reaction. Bacillus subtilis and E. coli 0157:H7 
amplification products were labeled on one channel (green) while Listeria 
monocytogenes and Salmonella entericia were labeled on a second channel (red). The 
labeling method was performed on dilutions of the target DNA ranging from 10' 2 to 10~ 7 , 
which were calculated to represent a range of target cells from 10^ to 10 l . 

[0063] After chemically blocking sites on the slide surface where no probe 

molecules exist (i.e. between the addresses and on the edges), the labeled products from 
the different dilutions of target DNA were hybridized to the array overnight in 
Tris-sodium citrate-SDS buffer at 62°C. The slides were then washed three times in 
sodium citrate buffer to remove excess labeled target DNA molecules and were scanned 
in a commercially available scanner. The relative fluorescence intensity above 
background was recorded. 

[0064] Results from several of the arrays are shown in Fig. 4 for sample levels 

corresponding to 10, 100 and 1000 cells. The specificity of array detection was 
demonstrated by the absence of any fluorescent signal above background at any probe 
address other than that of the target organism. In most of the diluted samples, significant 
fluorescence was observed at each address for the target organisms in the experiment. At 
the highest dilutions, fluorescence intensity became undetectable at several addresses, 
indicating that several specific ones of the target genes were not amplified by PCR with 
equal efficiency. Nonetheless, fluorescence was still measurable at most addresses 
specific for a given target organism at target DNA concentrations representing only ten 
cells of that species. 
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EXAMPLE 2 

[0065] The purpose of this study was to experimentally identify species-specific 

probes. To identify segments of the chromosome that are unique (diagnostic) to the 
different species of foodborne pathogenic bacteria, genome libraries were prepared from 
randomly sheared DNAs of each of the species of interest and the libraries were probed 
with fluorescently labeled DNA from each of the species of interest. Cloned segments 
were chosen based on their ability to hybridize to DNA from the species from which they 
originated, but not other species. 

[0066] The libraries were prepared from strains of Bacillus cereus, 

Campylobacter jejuni, Escherchia coli 0157:H7, Listeria monocytogenes, Salmonella 
enterica, Shigella dysenteriae, Staphylococcus aureus, and Yersinia enterocolitica. A 
total of 1,000 clones from each library were chosen and amplified individually by PCR 
using T3 and T7 primers that flank the insertion site of the cloning vector. Amplified 
DNA was then printed onto glass slides to fabricate microarrays containing segments 
amplified from each of the libraries. The printed specific genes were now called 
"Probes." 

[0067] To find the cloned DNA segments from each of the species that 

specifically hybridize to labeled DNA from the species of origin, the microarrays were 
hybridized with chromosomal DNA extracts from each of the different species. In each 
experiment, DNA from seven of the eight species was labeled with one fluorescent dye 
(Cy5) and the eighth was labeled with a second fluorescent dye (Cy3). The labeled 
DNAs were then mixed and hybridized to the array. As shown in Figure 5, fluorescence 
analysis can easily identify the cloned segments which hybridize only to the species 
labeled on the Cy3 channel. The false colored image is comprised of superimposed 
images from fluorescence scanning on the CY5 (green) and CY3 (red) channels. In the 
combined image, probes on the microarray that hybridize to the reference genome (Cy3 
channel) and one or more of the test genomes (Cy5 channel) are colored yellow. Those 
segments hybridizing exclusively to one or more of the test genomes only appear green 
and those hybridizing exclusively to the reference genome appear red. In Figure 5, the 
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reference genome was Listeria monocytogenes while the test genomes were from the 
other seven species. 

EXAMPLE 3 

[0068] The purpose of the study was to test the multispecies array with ground 

beef spiked with Listeria. Once species-specific segments were identified from each of 
the eight species of interest, an array was fabricated with species-specific (diagnostic) 
genome segments from each species. The array was fabricated with the diagnostic 
genome segments from a given species placed adjacent to one another in rows across the 
array. 

[0069] To test the efficacy of the array to detect contaminating bacteria from a 

food sample, known quantities of pure bacterial species were introduced into a food 
matrix. The food was then sampled and the sample subjected to enrichment by 
microbiological culture. For the experiment described herein, 90 ml Listeria selective 
enrichment medium (UVMI, Oxoid) was used. The food matrix was ground beef. A 
total of 10 g of the ground beef was spiked with 1000 Listeria cells. The contaminated 
beef was then transferred to a stomacher bag, homogenized for 2 minutes, and allowed to 
incubate for 8 hr at 30°C. After incubation, 1 ml of the culture was transferred to 10 ml 
of Listeria secondary selective enrichment medium (UVMII, Oxoid), and incubated at 
30°Cfor 16 hr. 

[0070] DNA was extracted from the secondary enrichment culture by harvesting 

the cells by centrifugation followed by disruption in 1 ml of buffer containing 200/xg/ml 
lysostaphin and 10 mg/ml lysozyme, add 8 /xl 100 mg/ml RNase A stock solution. The 
cell were incubated in the buffer at 37°C for at least 30 min before addition of 50 ^1 of 
proteinase K (20 mg/ml in ddH 2 0) and additional incubation at 37 °C for 30 min (until 
no beef particulates are visible). At that point, 0.35 ml of Buffer B2 was added, mixed by 
inversion several times, and incubated at 50 °C for 30 min. 

[0071] The lysed cell suspension was then applied to a Qiagen genomic tip 20/G 

column equilibrated with 2 ml Buffer QBT. After allowing the lysate to flow through the 
column, the bound DNA was washed with 1 ml of Buffer QC for three times and finally 
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the DNA was eluted with 0.8 ml of pre-warmed (50°C) buffer QF. The DNA was 
precipitated with 600 ul of isopropanol. The precipitated DNA was then collected by 
centrifugation and dissolved in 10 /il ddH 2 0. The entire DNA sample was then used for 
labeling on the CY5 channel. For control purposes, 100 ng of DNA from each of the 
species on the array was also labeled on the CY3 channel and mixed with the labeled 
DNA from the enrichment. The mixed labeling reactions were then hybridized to the 
multispecies array. The resulting image is shown in Figure 6. The image, derived from 
superimposition of the individual images from CY5 and CY3 channels, is artificially 
colored. Array address with probes hybridizing to both the control DNA and the sample 
DNA are colored yellow, those hybridizing to control only are green. Only array 
addresses from the Listeria monocytogenes -specific probes hybridized to the sample. 

DISCUSSION 

[0072] The examples confirmed that the multispecies array detects and quantifies 

the presence of pertinent bacterial species in the targeted range, in many cases with great 
sensitivity, and that labeling may be effectively performed either during or after 
amplification. For the organisms where the sensitivity was lower, a preliminary culture 
step may be used to selectively increase population of those organisms, or the 
amplification process may be adjusted to augment this component. Alternatively, 
separate arrays could be provided for different sets of organisms, with the processing 
cocktails for each one optimized for its subgroup of species. 

[0073] As described above, systems of the invention include three major 

components: the sample preparation and DNA amplification system, the array itself, and 
the image reading or analysis system/software. Applicant contemplates that the sample 
preparation and DNA amplification system will in general be developed separately and 
combined for final optimization. Thus, one first identifies a set of preparation steps to 
carry out fluorescent PCR amplification of target genes from a selected set of microbial 
populations, examining combinatorial cocktails of primers directed against short 
oligonucleotides that are unique to or are overrepresented in the genomes of the target 
model organisms, and identifies cocktails containing sets of primers that will allow 
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amplification of each gene target. These primers may be tested initially against a model 
array having only a few target species, using a commercial visible light scanner or an 
available NIR scanner, as applicable, for image analysis. At this stage, the performance 
of different fluorescent PCR protocols are evaluated for comparative purposes. During 
this initial stage, one may also examine the sensitivity and linearity of the array-based 
testing approach once the fluorescent PCR protocols have been worked out with a model 
array for analysis. As noted in the example above, adequate sensitivity was observed for 
most of the initially-selected gene sequences even without optimization. 

[0074] A preferred sample processing instrument includes a sample chamber with 

appropriate housings and reagent lines into which a sample of organisms is lysed and in 
which its DNA adsorbed to a fixed matrix. Once the DNA is immobilized on a solid 
matrix in the chamber, reagents for washing the adsorbed nucleic acids and for 
conducting fluorescent PCR amplification are then pumped into the sample chamber in a 
sequence coordinated with heating cycles. The fluorescent PCR products are then eluted 
or pumped from the chamber into a chamber housing the array. The fluidics 
mechanisms, column-housing chamber, lines for reagent streams, and heating-cooling 
jackets of the instrument for PCR may be similar to those of known micro- or 
meso-fluidic devices, and are operated under program control to perform both the nucleic 
acid extraction and the PCR amplification cycling. 

[0075] Thus, by utilizing a flow cell in which the nucleic acids from the sample 

can be adsorbed, the system provides efficient and user-friendly sample processing 
system for quantitatively extracting nucleic acids from the microorganisms for 
downstream processing in the system. Capillary ports are used to pump reagents into and 
through the cell to wash the nucleic acids and perform the PCR reactions. "Tuning" of 
the PCR amplification and labeling system may be accomplished by measuring the 
efficiency of the extraction fluidics during initial development for performance 
evaluation of the PCR reaction in the sample cartridge on the model array, for example 
by using known quantities of model microorganisms such as E. coli and Listeria 
monocytogenes in the sample preparation system, and processing conditions are altered, 
or poorly-amplified sequences are replaced with other sequences, to assure that effective 
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levels of detection are achieved for the full target range. The results reported above for 
the EXAMPLE indicate feasibility and operability of the simultaneous-array system for 
pure cultures. 

[0076] In order to better adapt the preparation and amplification protocol to 

quantitatively assess the profile of the microbial community present in a specific class of 
food samples, this parameter is preferably calibrated early in development, and is 
evaluated at each critical transition step. A first measurement point may be taken after a 
PCR methodology has been identified, but not necessarily optimized. At this stage, 
performance of a small model array and manual sample preparation may be assessed with 
spiked food samples, and may be compared both to results obtained with conventional 
methods and to the values obtained with pure cultures. Changes to the PCR protocol may 
be implemented to improve efficiency of, or to replace, ineffective portions of the 
process. During the transition to successively larger arrays, the efficiency, sensitivity and 
linearity are similarly evaluated and calibrated. Since one can not possibly test for the 
same number of species by conventional methods as can large arrays, results from large 
prototype arrays may be compared to analysis of a few species by conventional methods. 
This approach identifies performance characteristics such as cross-reactivity and 
background from food matrix compounds, allowing one to evaluate and correct for these 
effects, in addition to determining sensitivity. 

[0077] Figure 3 illustrates both the method and the overall system or equipment 

for carrying out the method of the present invention, as applied to testing food products. 
As shown in Figure 3, a food sample 1 is first subjected to a preparation and culture step 
10. Culture may be carried out manually, or in an automated assembly which basically 
conditions and places appropriate samples of the food into one or more culture media and 
incubates or otherwise provides culture conditions appropriate to grow the targeted 
species or set of species to a suitable population sample size. This may, for example, 
grow to 50 to 5000 cells or more, and may, for zero tolerance organisms, be optimized to 
achieve suitable detection levels with as few as ten cells in the sample. The population 
cultures are then washed, concentrated, lysed or otherwise processed to prepare them as 
one or more sample DNA aliquots, i.e. groupings of process compatible species, e.g., 
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nucleic acid segments, fragments and components from the species or subgroups of the 
species, which are loaded in one or more preparation tubes or columns 21 of a processor 
20. 

[0078] As noted above the processing at this stage may be conducted by semi- 

automated or manual techniques, but is preferably carried out in an automated processor 
20 to effect simultaneous PCR amplification of all the target components (or all of one 
set of PCR-compatible process targets) simultaneously in the tube or column 21 . In 
general, this will be a micro-or-meso-fluidic processing system in which various valves 
connected to solvent, buffer, reagent and primer supplies, as well as a heater 22, are 
controlled by a microprocessor to carry out a sequence of washing, blocking or 
denaturing, attachment of primers, and a number of PCR incubation cycles to amplify 
and label a set of several target materials simultaneously by polymerase chain reaction 
and thereby produce effectively detectable quantities of the target nucleic acid sequences 
present in the sample. The amplified and labeled products are then output to the array 
hybridization system where the predefined set of probes are allowed to form hydrogen 
bonds on the corresponding target positions of the prepared array 50, which is fabricated 
with a matrix of specialized gene targets at predetermined matrix positions on its surface. 

[0079] In its most general form, applicant contemplates that several different PCR 

processing regimens may be carried out in parallel, but with different materials and 
amplification regimens, in separate columns or tubes 21 of the apparatus in order to best 
amplify the full set of target materials to which the probes of the array 50 are directed. 
Thus, the optimization processes described above need not result in a single culture 
cocktail or a single PCR amplification process which is effective to amplify all target 
materials, but may identify several sets of processing conditions and materials that 
collectively prepare the sample in one or more sub-groups for testing with all probes of 
the array 50. To this end, the PCR processor 20 may accommodate several tubes 21 
simultaneously fed by valved tubing from separate sources of primers and having 
separately controlled incubation cycles and solvent/reagent mixes. Similarly, the output 
lines from each of the chambers may feed in parallel to a robot assembly having, for 
example, a number of parallel pipette spotters individually fed by the different PCR 
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processing chambers and spotted onto different predetermined subsets of the addresses of 
a single array 50 to test for all the target species. Preferably, however, the process is 
optimized to process all components together in a single column. 

[0080] In addition to PCR amplification, the system can also accommodate 

labeling of the extracted DNA using non-PCR based methods such as random priming or 
direct chemical labeling of extracted DNA. In these cases, the labeling step would be 
performed in one or more tubes 21 that have separate valves for reagent mixes. The 
tubes may be heated and cooled to allow for hydrogen bond formation between synthetic 
primers (such as random primer) and the template DNA. 

[0081] After amplification and/or labeling and deposit on the array, the labeled 

DNA is allowed to hybridize to the probes and the array 50 is then washed and passes to 
an array reader 60 which, as noted above, preferably is a visible light or near-IR reader 
that detects the presence and/or intensity label at each site of the array to identify the 
presence of bound target material of each type. The array reader 60 provides its output to 
a microprocessor 70, which may also be programmed to control the operating parameters 
of the PCR amplification module 20, as well as to record and track the label detection 
outputs from the reader 60. In general, the basic hardware and equipment of the PCR 
amplification module 20 may include, or consist largely or entirely of a commercially 
available or modified microfluidics processing apparatus, having its own microcontroller, 
and the microprocessor 70 interfaces with that controller to provide the specific operating 
parameters for specific groups of systems involving simultaneous amplification of the 
multiple target species of foodborne bacteria as determined by the optimization process 
outlined above. 

[0082] Advantageously, the outputs of the array reader 60 include not simply 

detection at the species level, but distribution of species, as well as genomic variations. 
Collectively, one complete output set is referred to herein as a "distribution". 

[0083] In addition to its functions of controlling the process operation and storing 

the detected distributions, microprocessor 70 preferably includes or connects to a system 
having a database which may include prior test results, as well as histories, extrinsic 
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parameters and prior correlations or links to any of the foregoing data sets. The histories 
may include information such as the source of the food components of the product being 
tested, results of chemical or physical profiles, sensory analyses, and even history such as 
transportation or shipping conditions. The array outputs may also be corrected, or linked 
to or grouped based on certain extrinsic and intrinsic properties of the food such as color, 
clarity, redox potential, temperature, pH, water activity or ingredient age, and the 
distributions may be output together with interpretive or comparison data. 

[0084] In one embodiment, the comparison data includes extrinsic measures of 

previous test results, such as normative bacteria distributions measured for the same 
products in different batches, analytic measures such as standard deviations of the 
distributions of bacterial flora, and other such information. The output data may further 
include other forms of previously-developed knowledge such as indications of shortened 
shelf-life, indications of specific conditions (such as off-flavor or color) or indications 
(such as a prior condition detected during recall following batch release) that may require 
closer analysis or testing with a different or more inclusive array, or may call for a 
specific, separate, manual test. 

[0085] In a preferred implementation of the system, the microprocessor contains a 

mining and learning database which operates on the output of the array reader 60 to 
generate or detect relevant higher level information by correlating or comparing the 
reader output to selected data from the data module, including prior test results, histories 
and such previously developed correlations. For example, the microprocessor may 
compare the array reader output to a normative reading, detect an excess of two species 
or the atypical presence of several other species, then perform a correlation of this 
sub-group of information to determine, for example, that such a distribution indicates a 
maladjusted process equipment parameter, or is associated with likely contamination by 
yet a fourth organism that has not been tested for. Preferably the data processor 70 also 
includes an output module and display to output a higher level indication such as a listing 
of detected species or a population landscape, and to output specific notes on any adverse 
consequences or process warning indications associated with the detected distribution. 
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[0086] The multispecies arrays of the present invention may be configured for 

different foods, classes of foods, or processes. Thus, when a non-meat food is being 
tested, an array that does not include probes specific to meat bacteria may be used. For 
certain starches and tubers, multiple probes for organisms specific to that class of food 
materials may be included. Alternatively, the applicant contemplates that after a suitable 
process of development and checking for cross-reactivities, amplification and extraction 
efficiencies, a single large array may be fabricated to detect such a great number of 
species that it may be used for all or most foods. In that case, for any given food, the 
processor 20 and robotic spotter may be programmed to apply only a subset of their 
normal operations, for example, to implement only the necessary culture steps and PCR, 
and to load only the necessary addresses of the array positions for the specific species and 
gene sequences applicable to the particular class of food sample being tested. 

[0087] In this manner the invention provides a food monitoring and detection 

system of enhanced information content and sensitivity which detects not only the 
presence of deleterious pathogens, but the preconditions, indications and associations that 
enable or follow from the presence of detected species distributions, and to identify 
indications of other organisms not specifically probed. It therefore permits the operator 
to adjust process conditions and prevent undesired occurrences, as well as to develop by 
data mining and correlation, a proprietary database directed to the proprietor's particular 
processes and foods for optimizing not only sanitation, but flavor, texture and other less 
well controlled, more transient or less understood aspects of food quality and food 
processing, as well as controlling extrinsic factors such as the sources of ingredients. 

[0088] As noted above, the invention includes an ordered multispecies array with 

probes for multiple characterizing sequences for each species. The ability to select from 
a vast number of characteristic sequences is expected to greatly facilitate the process of 
determining suitable sequences and species groupings for simultaneous PCR 
amplification. It also allows selection of sequences which may have been largely 
neglected in conventional bacteriological investigations because their function has not 
been determined, but which are advantageous in this context because of their efficient 
amplification and their unique identification with the target species. Thus, the arrays 
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with multi-sequence detection for each of multiple species, offer both high sensitivity and 
multi-species, or spectral, distribution output. 

[0089] In addition to food processing, this same system may be applied, by 

simply selecting a possibly different set of target species and sequences, to carry out 
environmental, epidemiological, or clinical testing. Thus, for example, rather than testing 
a food product, the system may be applied to test swabs taken from surfaces or equipment 
in a slaughterhouse, hospital, kitchen, processing plant or other workplace, or may be 
applied to samples or cultures taken from groundwater or effluent. Similarly, the species 
distributions maybe selected to assess, for example, specific medical tasks, e.g., to assess 
respiratory flora to better detect or diagnose the epidemiological patterns of influenza in 
population centers or to address the possible causative agents for particular presenting 
symptoms, such as testing CSF for meningitis-like presenting symptoms, or swabs for 
pneumonia-like presenting symptoms, in a clinic. 

[0090] In this regard, the test sequences need not be limited to bacterial species 

and gene sequences, but may include probe sequences for viral, fungal and parasitic 
organisms and may also include those pathogenic plants. For the different applications, 
the palette of species may be directed to product-colonizing species, environment or 
niche-colonizing species, or infectious or chronic mammal- colonizing species. For more 
specialized clinical applications, the palette may directed to efficiently perform a 
differential diagnosis of a specific presenting condition or syndrome, for which a 
recognized set of bacterial, viral or cellular DNA arrays have been selected to rule out or 
identify the presence of potential etiological factors or determinative agents. 

[0091] The invention being thus disclosed, further variations and modifications 

will occur to those skilled in the art, and all such modifications and variations are 
intended to be within the scope of the invention as described above and defined in the 
claims appended hereto, and their equivalents. 

[0092] What is claimed is: 
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